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Abstract 

A graph property is monotone if it is closed under removal of vertices and edges. In this 
paper we consider the following algorithmic problem, called the edge-deletion problem; given a 
monotone property V and a graph G, compute the smallest number of edge deletions that are 
needed in order to turn G into a graph satisfying V. We denote this quantity by E!p{G). The 
first result of this paper states that the edge-deletion problem can be efficiently approximated for 
any monotone property. 

• For any fixed e > and any monotone property V, there is a deterministic algorithm, which 
given a graph G — {V,E) of size n, approximates E!p{G) in linear time 0{\V\ + \E\) to 
within an additive error of en^. 

Given the above, a natural question is for which monotone properties one can obtain better 
additive approximations of E!p . Our second main result essentially resolves this problem by giving 
a precise characterization of the monotone graph properties for which such approximations exist. 

(1) If there is a bipartite graph that does not satisfy V, then there is a (5 > for which it is 
possible to approximate E!p to within an additive error of n^^^ in polynomial time. 

(2) On the other hand, if all bipartite graphs satisfy V, then for any (5 > it is A^P-hard to 
approximate E!p to within an additive error of n^^^ . 

While the proof of (1) is relatively simple, the proof of (2) requires several new ideas and involves 
tools from Extremal Graph Theory together with spectral techniques. Interestingly, prior to this 
work it was not even known that computing E!p precisely for the properties in (2) is A^P-hard. 
We thus answer (in a strong form) a question of Yannakakis, who asked in 1981 if it is possible 
to find a large and natural family of graph properties for which computing E!p is A^P-hard. 
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1 Introduction 



1.1 Definitions, background and motivation 

The topic of this paper is graph modification problems, namely problems of the type: "given a graph 
G, find the smallest number of modifications that are needed in order to turn G into a graph satisfying 
property V" . The main two types of such problems are the following, in node modification problems, 
one tries to find the smallest set of vertices, whose removal turns G into a graph satisfying V, while 
in edge modification problems, one tries to find the smallest number of edge deletions/additions that 
turn G into a graph satisfying V. In this paper we will focus on edge-modification problems. Before 
continuing with the introduction we need to introduce some notations. 

For a graph property V, let Vn denote the set of graphs on n vertices which satisfy V. Given two 
graphs on n vertices, G and G' , we denote by A{G,G') the edit distance between G and G' , namely 
the smallest number of edge additions and/or deletions that are needed in order to turn G into G' . 
For a given property V, we want to denote how far is a graph G from satisfying V. For notational 
reasons it will be more convenient to normalize this measure so that it is always in the interval [0, 1] 
(actually [0, ^]). We thus define 

Definition 1.1 {E-p{G)) For a graph property V and a graph G on n vertices, let 

. AiG,G') 
E-piG) = mm . 

In words, E'p{G) is the minimum edit distance of G to a graph satisfying V after normalizing it by 
a factor of n"^. 

Graph modification problems are well studied computational problems. In 1979, Garey and John- 
son [28] mentioned 18 types of vertex and edge modification problems. Graph modification problems 
were extensively studied as these problems have applications in several fields, including Molecular 
Biology and Numerical Algebra. In these applications a graph is used to model experimental data, 
where edge modifications correspond to correcting errors in the data: Adding an edge means cor- 
recting a false negative, while deleting an edge means correcting a false positive. Computing E'p[G) 
for appropriately defined properties V have important applications in physical mapping of DNA (see 
|17j . |29j and [31]). Computing E-p{G) for other properties arises when optimizing the running time 
of performing Gaussian elimination on a sparse symmetric positive-definite matrix (see |42j ) . Other 
modification problems arise as subroutines for heuristic algorithms for computing the largest clique 
in a graph (see [38]). Some edge modification problems also arise naturally in optimization of circuit 
design ^18j. Wc briefiy mention that there are also many results about i;eriea; modification problems, 
notably that of Lewis and Yannakakis [38], who proved that for any nontrivial hereditary property 
V, it is A^P-hard to compute the smallest number of vertex deletions that turn a graph into one 
satisfying V. (A graph property is hereditary if it is closed under removal of vertices.) 

A graph property is said to be monotone if it is closed under removal of both vertices and edges. 
Examples of well studied monotone properties are A:-colorability, and the property of being H-fiee 
for some fixed graph H. (A graph is H-iree if it contains no copy of as a not necessarily induced 
subgraph.) Note, that when trying to turn a graph into one satisfying a monotone property we 
will only use edge deletions. Therefore, in these cases the problem is sometimes called edge-deletion 
problem. Our main results, presented in the following subsections, give a nearly complete answer to 
the hardness of additive approximations of the edge-deletion problem for monotone properties. 
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1.2 An algorithm for any monotone property 



Our first main result in this paper states that for any graph property V that belongs to the large, 
natural and well studied family of monotone graph properties, it is possible to derive efficient ap- 
proximations of E-p. 

Theorem 1.1 For any fixed e > and any monotone property V there is a deterministic algorithm 
that given a graph G on n vertices computes in time 0{n^) a real E satisfying \E — E'p{G)\ < e. 

Note, that the running time of our algorithm is of type f{e)n^, and can in fact be improved to linear 
in the size of the input by first counting the number of edges, taking E = Q m. case the graph has 
less than en^ edges. We note that Theorem 11.11 was not known for many monotone properties. In 
particular, such an approximation algorithm was not even known for the property of being triangle- 
free and more generally for the property of being H-hee for any non-bipartite H. 

Theorem II. II is obtained via a novel structural graph theoretic technique. One of the applications 
of this technique (roughly) yields that every graph G, can be approximated by a small weighted graph 
W, in such a way that Ep(G) is approximately the optimal solution of a certain related problem (ex- 
plained precisely in Section [3]) that we solve on W. The main usage of this new structural-technique 
in this paper is in proving Lemmas 13.41 and 13.51 that lie at the core of the proof of Theorem 11.11 
This new technique, which may very well have other algorithmic and graph-theoretic applications, 
applies a result of Alon, Fischer, Krivelevich and Szegedy ^ which is a strengthening of Szemeredi's 
Regularity Lemma |44j . We then use an efficient algorithmic version of the regularity lemma, which 
also implies an efficient algorithmic version of the result of [4], in order to transform the existential 
structural result into the algorithm stated in Theorem ll.il 

We further use our structural result in order to prove the following concentration-type result 
regarding the edit distance of subgraphs of a graph. 

Theorem 1.2 For every e and any monotone property V there is a d = d{e,V) with the following 
property: Let G be any graph and suppose we randomly pick a subset D, of d vertices from V{G) . 
Denote by G' the graph induced by G on D. Then, 

Prob[ lE-piG') - E'p{G)\ > e] < e . 

An immediate implication of the above theorem is the following, 

Corollary 1.2 For every e > and any monotone property V there is a randomized algorithm 
that given a graph G computes in time 0(1) a real E satisfying \E — E'p{G)\ < e with probability at 
least 1 — e. 

We stress that there are some computational subtleties regrading the implementation of the 
algorithmic results discussed above. Roughly speaking, one should define how the property V is 
"given" to the algorithm and also whether e is a fixed constant or part of the input. These issues 
are discussed in Section [5l 

It is natural to ask if the above results can be extended to the larger family of hereditary proper- 
ties, namely, properties closed under removal of vertices, but not necessarily under removal of edges. 
Many natural properties such as being Perfect, Chordal and Interval are hereditary non- monotone 
properties. By combining the ideas we used in order to prove Theorem 11.11 along with the main ideas 
of [6] it can be shown that Theorem ll.il (as well as Theorem 1 1 . 2 1 and Corollary II. 2p also hold for any 
hereditary graph property. 
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1.3 On the possibility of better approximations 

Theorem 11.11 implies that it is possible to efficiently approximate the distance of an n vertex graph 
from any monotone graph property V, to within an error of ev? for any e > 0. A natural question one 
can ask is for which monotone properties it is possible to improve the additive error to Tn?~^ for some 
fixed (5 > 0. In the terminology of Definition ll.il this means to approximate E-p to within an additive 
error of for some 5 > 0. Our second main result in this paper is a precise characterization of the 
monotone graph properties for which such a 5 > exist£]. 

Theorem 1.3 Let V he a monotone graph property. Then, 

1. If there is a bipartite graph that does not satisfy V , then there is a fixed (5 > for which it is 
possible to approximate E-p to within an additive error of in polynomial time. 

2. On the other hand, if all bipartite graphs satisfy V , then for any fixed 6 > it is NP-hard to 
approximate E-p to within an additive error of . 

While the first part of the above theorem follows easily from the known results about the Turan 
numbers of bipartite graphs (see, e.g., [45j), the proof of the second item involves various combinato- 
rial tools. These include Szemeredi's Regularity Lemma, and a new result in Extremal Graph Theory, 
which is stated in Theorem 16.11 (see Section [6|) that extends the main result of fl4] and [15]. We 
also use the basic approach of [T], which applies spectral techniques to obtain an A^P-hardness result 
by embedding a blow-up of a sparse instance to a problem, in an appropriate dense pseudo-random 
graph. Theorem 16.11 and the proof technique of Theorem 11.31 may be useful for other applications 
in graph theory and in proving hardness results. As in the case of Theorem ll.il the second part of 
Theorem 11.3! was not known for many specific monotone properties. For example, prior to this paper 
it was not even known that it is A^P-hard to precisely compute E-p, where V is the property of being 
triangle-free. More generally, such a result was not known for the property of being if-free for any 
non-bipartite H. 

1.4 Related work 

Our main results form a natural continuation and extension of several research paths that have been 
extensively studied. Below we survey some of them. 

1.4.1 Approximations of graph- modification problems 

As we have previously mentioned many practical optimization problems in various research areas 
can be posed as the problem of computing the edit-distance of a certain graph from satisfying a 
certain property. Cai [16] has shown that for any hereditary property, which is expressible by a 
finite number of forbidden induced subgraphs, the problem of computing the edit distance is fixed- 
parameter tractable. Khot and Raman [33j proved that for some hereditary properties V, finding in 
a given graph G, a subgraph that satisfies V is fixed-parameter tractable, while for other properties 
finding such a subgraph is hard in an appropriate sense (see [33J). 

Note that Theorem 11.1! implies that if the edit distance (in our case, number of edge removals) of 
a graph from a property is i}{n'^), then it can be approximated to within any multiplicative constant 
1 + e. 

^We assume henceforth that V is not satisfied by all graphs. 
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1.4.2 Hardness of edge-modification problems 

Natanzon, Shamir and Sharan ^39j proved that for various hereditary properties, such as being 
Perfect and Comparabihty, computing Ep is A^P-hard and sometimes even A^P-hard to approximate 
to within some constant. Yannakakis 06] has shown that for several graph properties such as 
outerplanar, transitively orientable, and line-invertible, computing E-p is A^P-hard. Asano [12] and 
Asano and Hirata [13j have shown that properties expressible in terms of certain families of forbidden 
minors or topological minors are A^P-hard. 

The A^P-completeness proofs obtained by Yannakakis in |46j . were add-hoc arguments that ap- 
plied only to specific properties. Yannakakis posed in [36] as an open problem, the possibility of 
proving a general A^P-hardness result for computing E-p that will apply to a general family of graph 
properties. Theorem 11.31 achieves such a result even for the seemingly easier problem of approximat- 
ing Ep. 

1.4.3 Approximation schemes for "dense" instances 

Fernandez de la Vega [22j and Arora, Karger and Karpinski llj showed that many of the classi- 
cal A^P-complete problems such as MAX-CUT and MAX-3-CNF have a PTAS when the instance 
is dense, namely if the graph has Q{n?) edges or the 3-CNF formula has r2(n^) clauses. Approxi- 
mations for dense instances of Quadratic Assignment Problems, as well as for additional problems, 
were obtained by Arora, Frieze and Kaplan [T0|. Frieze and Kannan [!26j obtained approximations 
schemes for several dense graph theoretic problems via certain matrix approximations. Alon, Fer- 
nandez de la Vega, Kannan and Karpinski [3j obtained results analogous to ours for any dense 
Constraint-Satisfaction-Problem via certain sampling techniques. It should be noted that all the 
above approximation schemes are obtained in a way similar to ours, that is, by first proving an addi- 
tive approximation, and then arguing that in case the optimal solution is large (that is, O(n^) in case 
of graphs, or Q{n^) in case of 3-CNF) the small additive error translates into a small multiplicative 
error. 

All the above approximation results apply to the family of so called Constraint-Satisfaction- 
Problems. In some sense, these problems can express graph properties for which one imposes re- 
strictions on pairs of vertices, such as A;-colorability. These techniques thus fall short from applying 
to properties as simple as Triangle- freeness, where the restriction is on triples of vertices. The tech- 
niques we develop in order to obtain Theorem 11.11 enable us to handle restrictions that apply to 
arbitrarily large sets of vertices. 

We briefly mention that Ep is related to packing problems of graphs. In [32] and [17] it was 
shown that by using linear programming one can approximate the packing number of a graph. In 
Section [9] we explain why this technique does not allow one to approximate Ep. 

1.4.4 Algorithmic applications of Szemeredi's Regularity Lemma 

The authors of [2] gave a polynomial time algorithmic version of Szemeredi's Regularity Lemma. 
They used it to prove that Theorem 11.11 holds for the A:-color ability property. The running time of 
their algorithm was improved by Kohayakawa, Rodl and Thoma [3l] . Frieze and Kannan [25] further 
used the algorithmic version of the regularity lemma, to obtain approximation schemes for additional 
graph problems. 

Theorem 11.11 is obtained via the algorithmic version of a strengthening of the standard regularity 
lemma, which was proved in [4] , and it seems that these results cannot be obtained using the standard 
regularity lemma. 
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1.4.5 Tolerant Property- Testing 

In standard Property- Testing (see [23] and [Hj) one wants to distinguisli between the graplis G tliat 
satisfy a certain graph property V, or equivalently those G for which E-p(G) = 0, from those that 
satisfy E-piG) > e. The main goal in designing property-testers is to reduce their query-complexity, 
namely, minimize the number of queries of the form "are i and j connected in the input graphs?". 

Parnas, Ron and Rubinfeld ^40j introduced the notion of Tolerant Property- Testing, where one 
wants to distinguish between the graphs G that satisfy E-p(G) < 5 from those that satisfy E-p{G) > e, 
where < 5 < e < 1 are some constants. Recently, there have been several results in this line of 
work. Specifically, Fischer and Newman [2l] have recently shown that if a graph property is testable 
with number of queries depending on e only, then it is also tolerantly testable for any < 5 < e < 1 
and with query complexity depending on |e — (5|. Combining this with the main result of [7J implies 
that any monotone property is tolerantly testable for any < 5 < e < 1 and with query complexity 
depending on |e — Note, that Corollary 11.21 implicitly states the same. In fact, the algorithm 
implied by Corollarv 11.21 is the "natural" one, where one picks a random subset of vertices S, and 
approximates E-p{G) by computing E-p on the graph induced by S. The algorithm of [23] is far more 
complicated. Furthermore, due to the nature of our algorithm if the input graph satisfies a monotone 
property V, namely if E-p^G) = 0, we will always detect that this is the case. The algorithm of [23] 
may declare that Ep{G) > even if Ep{G) = 0. 

1.5 Organization 

The proofs of the main results of this paper. Theorems 11.11 and 11.31 are independent of each other. 
Sections [21 [3l |31 and [5] contain the proofs relevant to Theorem 11.11 and Sections [U [7] and [8] contain 
the proofs relevant to Theorem 11.31 

In Section [2] we introduce the basic notions of regularity and state the regularity lemmas that we 
use for proving Theorem 11.11 and some of their standard consequences. In Section [3] we give a high 
level description of the main ideas behind our algorithms. We also state the main structural graph 
theoretic lemmas. Lemmas 13.41 and 13.51 that lie at the core of these algorithms. The proofs of these 
lemmas appear in section |4l In Section [5] we give the proof of Theorems 11.11 and 11.21 as well as a 
discussion about some subtleties regarding the implementation of these algorithms. 

Section [6] contains a high-level description of the proof of Theorem 11.31 as well as a description of 
the main tools that we apply in this proof. In Section [7] we prove a new Extremal Graph-Theoretic 
result that lies at the core of the proof of Theorem 11.31 In Section [8] we give the detailed proof of 
Theorem 11.31 

The final Section [9] contains some concluding remarks and open problems. Throughout the 
paper, whenever we relate, for example, to a function /3.1, we mean the function / defined in 
Lemma/Claim/Theorem 3.1. 

2 Regularity Lemmas and their Algorithmic Versions 

In this section we discuss the basic notions of regularity, some of the basic applications of regular 
partitions and state the regularity lemmas that we use in the proof of Theorems 11.11 and II. 2[ See 
[35| for a comprehensive survey on the regularity- lemma. We start with some basic definitions. For 
every two nonempty disjoint vertex sets A and B oi a graph G, we define e{A^ B) to be the number 
of edges of G between A and B. The edge density of the pair is defined by B) = e{A, B)/\A\\B\. 
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Definition 2.1 (7-regular pair) A pair {A,B) is 7-regular, if for any two subsets A' <^ A and 
B' C B, satisfying \A'\ > -f\A\ and \B'\ > ^\B\, the inequality \d{A' , B') - d{A, B)\ < 7 holds. 

Throughout the paper we will make an extensive use of the notion of graph homomorphism which 
we turn to formally define. 

Definition 2.2 (Homomorphism) A homomorphism from a graph F to a graph K, is a mapping 
ip : V{F) ^ V{K) that maps edges to edges, namely {v,u) £ E{F) implies {(p{v) , ip{u)) G E{K). 

In what follows, F ^ K denotes the fact that there is a homomorphism from F to K. We will 
also say that a graph H is homomorphic to K \i H ^ K. Note, that a graph H is homomorphic to 
a complete graph of size k if and only if H is A;-colorable. 

Let F be a graph on / vertices and K a graph on k vertices, and suppose F ^ K. Let G be a 
graph obtained by taking a copy of K, replacing every vertex with a sufficiently large independent 
set, and every edge with a random bipartite graph of edge density d. It is easy to show that with 
high probability, G contains a copy of F (in fact, many). The following lemma shows that in order 
to infer that G contains a copy of F, it is enough to replace every edge with a "regular enough" pair. 
Intuitively, the larger / and k are, and the sparser the regular pairs are, the more regular we need 
each pair to be, because we need the graph to be "closer" to a random graph. This is formulated 
in the lemma below. Several versions of this lemma were previously proved in papers using the 
regularity lemma (see [35]). 

Lemma 2.3 For every real < r/ < 1, and integers k,f > 1 there exist 7 = " |g]g| (T/, k, /), and 
N = A j^g]g| (7/, k, f) with the following property. Let F be any graph on f vertices, and let Ui, . . . ,Uk 
be k pairwise disjoint sets of vertices in a graph G, where \Ui\ = ... = \Uk\ > N. Suppose there 
is a mapping ip : V{F) 1— > {1, . . . ,k} such that the following holds: If {i,j) is an edge of F then 
(^</3(j)) ^ip(j)) ^-regular with density at least rj. Then Ui, . . . ,Uk span a copy of F . 

Comment 2.4 Observe that the function ' jg]g| (7/, k, f) may and will be assumed to be monotone 
non-increasing in k and f and monotone non- decreasing in rj. Therefore, it will be convenient 
to assume that " J2^ Vj ^1 /) ^ v'^ ■ Similarly, we will assume that A jgyg| (7y, k, f) is monotone non- 
decreasing in k and f. Also, for ease of future definitions (in particular those given in set 
n[2^r], k, 0) = A[0|(r/, A;, 0) = 1 for any k > 1 and < r] < I. 

A partition A = {Vi \ 1 < i < k} of the vertex set of a graph is called an equipartition ii \ Vi\ and 
I Vj I differ by no more than 1 for all 1 < i < j < A; (so in particular each Vi has one of two possible 
sizes). The order of an equipartition denotes the number of partition classes {k above). A refinement 
of an equipartition A is an equipartition of the form B = {Vij | 1 < i < fc, 1 < j < 1} such that Vij 
is a subset of Vi for every 1 < i < fc and 1 < j < /. 

Definition 2.5 (7-regular equipartition) An equipartition B = {Vi | 1 < i < A;} 0/ the vertex set 
of a graph is called ^-regular if all but at most 7(2) of the pairs {Vi, Vi') are ^-regular. 

The Regularity Lemma of Szemeredi can be formulated as follows. 

Lemma 2.6 ( [44] ) For every m and 7 > there exists T = 2 [g]'g| (m, 7) with the following property: 
If G is a graph with n >T vertices, and A is an equipartition of the vertex set of G of order at most 
m, then there exists a refinement B of A of order k, where m <k <T and B is ^-regular. 
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2^^(m, 7) may and is assumed to be monotone non-decreasing in m and monotone non-increasing 
in 7. Szemeredi's original proof of Lemma [2. 6 1 was only existential as it supplied no efficient algorithm 
for obtaining the required equipartition. Alon et. al. [2] were the first to obtain a polynomial time 
algorithm for finding the equipartition, whose existence is guaranteed by lemma [2^ The running 
time of this algorithm was improved by Kohayakawa et. al. [M] who obtained the following result. 

Lemma 2.7 ([34]) For every fixed m and 7 there is an 0{n'^) time algorithm that given an equipar- 
tition A finds equipartition B as in Lemma \2.6l 

Our main tool in the proof of Theorem 11.11 is Lemma 12.91 below, proved in [4]. This lemma can 
be considered a strengthening of Lemma 12.61 as it guarantees the existence of an equipartition and 
a refinement of this equipartition that poses stronger properties compared to those of the standard 
7-regular equipartition. This stronger notion is defined below. 

Definition 2.8 (iS-regular equipartition) For a function £{r) : N ^ (0, 1), a pair of equiparti- 
tions A = {Vi I 1 <i <k] and its refinement B = {V^j | 1 < i < fe, ^ < j where Vij C Vi for 

all i,j, are said to be £ -regular if 

1. For all 1 < i < i' < k, for all 1 < j,j' < / but at most £{k)l'^ of them, the pair (V^j, V^'j/) is 
£{k) -regular. 

2. All 1 < i < i' < k but at most £{0)[2) of them are such that for all 1 < < I but at most 
£{0)l^ of them \d{Vi, Vi') - d{Vij, V^j')] < £{0) holds. 

It will be very important for what follows to observe that in Definition 12 . 81 we may use an arbitrary 
function rather than a fixed 7 as in Definition 12.51 (such functions will be denoted by £ throughout 
the paper). The following is one of the main results of ^4J. 

Lemma 2.9 ([4]) For any integer m and function £{r) : N ^ (0, 1) there is S = 3^2^ ^^-, £) such 
that any graph on at least S vertices has an £-regular equipartition A, B where \A\ = k > m and 
\B\ = kl<S. 

In order to make the presentation self contained we briefiy review the proof of Lemma 12. 9[ Fix 
any m and function £ and put = £{0). Partition G into m arbitrary subsets of equal size and 
denote this equipartition by Aq. Put M = m. Iterate the following task: Apply Lemma 12.61 on Ai-i 
with m = |.Aj_i| and 7 = £{M)/M'^ and let Ai be the refinement of Ai-i returned by Lemma 12.61 
If Ai-i and Ai form an <S-regular equipartition stop, otherwise set M = |.Ai_i| and reiterate. It is 
shown is [4j that after at most 100/C^ iterations, for some 1 < i < 100/^^ the partitions Ai-i and Ai 
form an iS-regular equipartition. Moreover, detecting an i for which this holds is very easy, that is, 
can be done in time O(n^) (see the proof in [3|). Note, that one can thus set the integer 9^^m, £) to 
be the order of Ai. In particular, the following is an immediate implication of the above discussion. 

Proposition 2.10 Ifm is bounded by a function of e only, then for any £ the integer S = S^2^ 'm, £) 
can be upper bounded by a function of e only. 

The e in the above proposition will be the e from the task of approximating E-p within an error 
of e in Theorem II. 11 Also, in our application of Lemma 12.91 the function £ will (implicitly) depend 
on €. For example, it will be convenient to set ^(0) = e. However, it follows from the definition of 
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!^^m,£) given above that even in this case it is possible to upper bound S^^m,£) by a function 
of e only. 

In order to design our algorithm we will need to obtain the equipartitions A and B that appear 
in the statement of Lemma [2.91 However, note that by the overview of the proof of Lemma [2.91 given 
above, in order to obtain this partition one can use Lemma [2. 71 as an efficient algorithm for obtaining 
the regular partitions. Moreover, by Proposition 12.101 whenever we apply either £ or Lemma 12.71 we 
are guaranteed that m (which in the above overview was M) is upper bounded by some function 
of e and 7 is lower bounded by some function of e. This means that each of the at most 100/^^ 
applications of Lemma |2. 101 takes O(n^) time. We thus get the following: 

Proposition 2.11 If m is bounded by a function of e only, then for any £ there is an 0{n'^) algorithm 
for obtaining the equipartitions A and B of Lemma \2.9[ 

3 Overview of the Proof of Theorem 11.11 

We start with a convenient way of handling a monotone graph property. 

Definition 3.1 (Forbidden Subgraphs) For a monotone graph property V, define T = T-p to be 
the set of graphs which are minimal with respect to not satisfying property V . In other words, a graph 
F belongs to J- if it does not satisfy V, but any graph obtained from F by removing an edge or a 
vertex, satisfies V . 

As an example of a family of forbidden subgraphs, consider V which is the property of being 2- 
colorable. Then T-p is the set of all odd-cycles. Clearly, a graph satisfies V if and only it contains no 
member of T-p as a (not necessarily induced) subgraph. We say that a graph is .F-free if it contains 
no (not necessarily induced) subgraph F ^ T . Clearly, for any family T ^ being .F-free is a monotone 
property. Thus, the monotone properties are precisely the graph properties that are equivalent to 
being .F-free for some family T . In order to simplify the notation, it will be simpler to talk about 
properties of type .F-free rather than monotone properties. To avoid confusion we will henceforth 
denote by Ejr{G) the value of Ep{G), where = T-p as above. 

The main idea we apply in order to obtain the algorithmic results of this paper is quite simple; 
given a graph G, a family of forbidden subgraphs T and e > we use Lemma [2. 91 with appropriately 
defined parameters in order to construct in 0(r?^ time a weighted complete graph W , of size de- 
pending on e but independent of the size of G, such that a solution of a certain "related" problem 
on W gives a good approximation of Ejr{G). As W will be of size independent of the size of G, we 
may and will use exhaustive search in order to solve the "related" problem on VF. In what follows 
we give further details on how to define W and the "related" problem that we solve on W. 

We start with the simplest case, where the property is that of being triangle-free, namely J- = 
{K3}. Let W be some weighted complete graph on k vertices and let < w{i,j) < 1 denote the 
weight of the edge connecting i and j in W. Let Ejr(W) be the natural extension of the definition 
of Ejr[G) to weighted graphs, namely, instead of just counting how many edges should be removed 
in order to turn G into an .F-free graph, we ask for the edge set of minimum weight with the above 
property. Let G be a /c-partite graph on n vertices with partition classes Vi, . . . , of equal size n/k. 
Suppose for every i < j we have d{Vi,Vj) = w{i,j) (recall that d{Vi,Vj) denotes the edge density 
between Vi and Vj). In some sense, W can be considered a weighted approximation of G, but to our 
investigation a more important question is whether W can be used in order to estimate Ejr(^G)7 In 
other words, is it true that Ejr(G) w Ejr{W)l 
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It is easy to see that Ejr{G) < Ejr(W). Indeed, given a set of edges S, whose removal turns W 
into a triangle free graph, we simply remove all edges connecting Vi and Vj for every G S. The 
main question is whether the other direction is also true. Namely, is it true that if it is possible to 
remove an^ from G and thus make it triangle free, then it is possible to remove from W a set of 
edges of total weight approximately ak'^ and thus make it triangle-free? If true this will mean that 
by computing Ejr(W) we also approximately compute Ejr{G). Unfortunately, this assertion is false 
in general, as the minimal number of edge modifications that are enough to make G triangle-free, 
may involve removing some and not all the edges connecting a pair (VJ, Vj), and in W we can remove 
only edges and not parts of them. It thus seems natural to ask what kind of restrictions should we 
impose on G (or more precisely on the pairs (V^, Vj)) such that the above situation will be impossible, 
namely, that the optimal way to turn G into a triangle free graph will involve removing either none 
or all the edges connecting a pair {Vi,Vj) (up to some small error). This will clearly imply that we 
also have Er{G) « Er{W). 

One natural restriction is that the pairs [Vi,Vj) would be random bipartite graphs. While this 
restriction indeed works it is of no use for our investigation as we are trying to design an algorithm 
that can handle arbitrary graphs and not necessarily random graphs. One is thus tempted to replace 
random bipartite graph with 7-regular pairs for some small enough 7. Unfortunately, we did not 
manage to prove that there is a small enough 7 > ensuring that even if all pairs {Vi^Vj) are 
7-regular then Ejri^G) ~ Ejr(yv). In order to circumvent this difficulty we use the stronger notion 
of iS-regularity defined in Section [2j As it turns out, if one uses an appropriately defined function 

then if all pairs iVi^Vj) are <f^(A;)-regular, one can infer that Ejr{G) EjriW). This result is 
(essentially) formulated in Lemma |3.4[ 

In the above discussion we considered the case J- = {-^^3}. So suppose now that J- is an arbitrary 
(possibly infinite) family of graph. Suppose we use a weighted complete graph W on k vertices as 
above in order to approximate some fc-partite graph. The question that naturally arises at this stage 
is what problem should we try to solve on W in order to get an approximation of Ejr[G). It is easy 
to see that G may be very far from being .F-free, while at the same time W can be ^-free, simply 
because J- does not contain graphs of size at most k. As an example, consider the case, where the 
property is that of containing no copy of the complete bipartite graph with two vertices in each 
side, denoted i^2,2- Now, if G is the complete bipartite graph -fC„/2.n/2 then it is very far from being 
i^2,2-free. However, in this case W is just an edge that spans no copy of i^2,2- 

It thus seems that we must solve a different problem on W . To formulate this problem we need 
the following definitions. 

Definition 3.2 (jF-homomorphism-free) For a family of graphs T , a graph W is called T- 
homomorphism-free if F ^ W for any F G JF. 

We now define a measure analogous to Ejr but with respect to making a graph .F-homomorphism- 
free. Note that we focus on weighted graphs. 

Definition 3.3 {T-ijr(yV)) For a family of graphs T and a weighted complete graph W on k vertices, 
let 7i'jr(W) denote the minimum total weight of a set of edges, whose removal from W turns it into 
an J^-homomorphism-free graph. Define, Tlj^{W) = Ti' j^{W) /k"^ . 

Note, that in Definition 13.21 the graph W is an unweighed not necessarily complete graph. Also, 
observe that when T = {i^s} then we have 7ij^(W) = Ejr(W). As it turns out, the "right" problem 
to solve on W is to compute Ti.j^{W). This is formulated in the following key lemma, whose proof 
appears in Section [JJ 



10 



Lemma 3.4 (The Key Lemma) For every family of graphs T , there are functions A jg^ A;, e) and 
e) with the following propert^: Let W he any weighted complete graph on k vertices and let 
(J be any k-partite graph with partition classes Vi, . . . ,Vk of equal size such that 

1- \n\ = ... = \Vk\>r^k,e). 

2. All pairs (Fj, Yj) are ^g^ fc, e)-regular. 

3. For every l<i<j<kwe have diVi, Vj) = w{i,j). 
Then, E^{G) > HriW) - e . 

It is easy to argue as we did above and prove that Ejr{G) < TLjriW) in Lemma [3^ (see the proof 
of Lemma l3.5p . however we will not need this (trivial) direction. It is important to note that while 
Lemma 13.41 is very strong as it allows us to approximate Ejr[G) via computing Ti.jr(W) (recall that 
W is intended to be very small compared to G) its main weakness is that it requires the regularity 
between each of the pairs to be a function of k, which denotes the number of partition classes, rather 
than depending solely on the family of graphs J^. We note that even if = {K^} as discussed above, 
we can only prove Lemma 13.41 with a regularity measure that depends on k. This supplies some 
explanation as to why Lemma 12.61 (the standard regularity lemma) is not sufficient for our purposes; 
note that the input to Lemma 12.61 is some fixed 7 > and the output is a 7-regular equipartition 
with number of partition classes that depends on 7 (the function 'L^^m, 7)). Thus, even if all pairs 
are 7-regular, this 7 may be very large when considering the number of partition classes returned by 
Lemma 12.61 and the regularity measure which Lemma 13.41 requires. Hence, the standard regularity 
lemma cannot help us with applying Lemma 13.41 In order to overcome this problem we use the 
notion of iS-regular partitions and the stronger regularity-lemma given in Lemma 12.9^ which, when 
appropriately used, allows us to apply Lemma 13.41 in order to obtain Lemma 13.51 below, from which 
Theorem 11.11 follows quite easily. The proof of this lemma appears in Section HI 

Lemma 3.5 For any e > and family of graphs T there are functions A^y^r) and j| (r) satisfying 
the following: Suppose a graph G has an S^^ r^ regular equipartition A = {Vi \ \ < i < k}, B = 
{^j' I 1 < ^ < ^, 1 < J < 0' where 

1. k> 1/e. 

2- ^ A jgyji (fc) for every 1 < i < k and 1 < j < l- 

Let W he a weighted complete graph on k vertices with w{i,j) = d{Vi,Vj). Then, 

\EAG)-nAw)\<e. 

Using the algorithmic version of Lemma 12.91 which is given in Proposition 12.111 we can rephrase 
the above lemma in a more algorithmic way, which is more or less the algorithm of Theorem ll.lt Given 
a graph G we use the 0{n^) time algorithm of Proposition 12. Ill in order to obtain the equipartition 
described in the statement of Lemma 13.51 We then construct the graph W as in Lemma 13.51 and 
finally use exhaustive search in order to precisely compute Tij^iW). By Lemma [3. 51 this gives a good 
approximation of Ejr[G). The proof of Theorem 1 1 . 1 1 app ears in Section El 

^The functions A j^^ fc, e) and '^ fg^ fc, e) will also (implicitly) depend on T. 
^The functions N^^r) and S^^r) will also (implicitly) depend on e and T. 
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4 Proofs of Lemmas 13.41 and 13.51 



In this section we apply our new structural technique in order to prove Lemmas 13.41 and 13.51 Re- 
gretfully, it is hard to precisely state what are the ingredients of this technique. Roughly speaking, 
it uses the notion of ^-regularity in order to partition the edges of a graph into a bounded number 
of edge sets, which have regular-partitions that are almost identicao and more importantly, the 
regularity-measure of each of the bipartite graphs in each of the edge sets can be a function of the 
number of clusters. 

We start this section with some definitions that will be very useful for the proof of Lemma 13.41 

Definition 4.1 For any (possibly infinite) family of graphs T , and any integer r let Tr be the 
following set of graphs: A graph R belongs to Tr if it has at most r vertices and there is at least one 
F £ such that F R. 

Definition 4.2 For any family of graphs T and integer r for which Tr 7^ 0, define 

^tW) = max min \V(F)\. (1) 

Define ^jr{r) =0 if J-r = Therefore, ^ jr{r) is monotone non- decreasing in r. 

Practicing definitions, note that if T is the family of odd cycles, then is precisely the family of 
non-bipartite graphs of size at most k. Also, in this case ^ jr{h) = k when k is odd, and ^:f{k) = k — 1 
when k is even. The "right" way to think of the function is the following: Let i? be a graph of 
size at most k and suppose we are guaranteed that there is a graph F' £ such that F' ^ R (thus 
R G J~k)- Then by this information only and without having to know the structure of R itself, the 
definition of ^'jr implies that there is a graph F £ T oi size at most ^'jr(fe), such that F ^ R. 

The function has a critical role in the proof of Lemma 13. 4[ While proving this lemma we will 
use Lemma 12.31 in order to derive that some k sets of vertices, which are regular enough, span some 
graph F £ T. Roughly speaking, the main difficulty will be that we will not know the size of F, 
and as a consequence will not know the regularity measure between these sets that is sufficient for 
applying Lemma [231 on these k sets (this quantity is " |2^ ??, k^ \V{F)\)). However, we will know that 
there is some F' £ which is spanned by these sets. The function ^j^{r) will thus be very useful 
as it supplies an upper bound for the size of the smallest F £ T which is spanned by these sets. See 
Proposition 14.41 where ^jr{r) has a crucial role. 

Proof of Lemma 13. 4t Given e and k let 

T = T{k, e) = T23](e/2, k, ^^(A;))). (2) 

We prove the lemma with ■- jg^ /c, e) and A jg^ fc, e) satisfying 

^A;,6)=min(e/2, l/T), (3) 



A^fc, 6) = T • %3](6/2, k, ^:F{k)) (4) 

Suppose G is a graph on n vertices, in which case each set Vi is of size ^. We may thus show 
that one must remove at least Tij^iW) ■ — en? edges from G in order to make it .7-"-free. To this 



Two regular partitions Vi, . . . , 14 and Ui, . . . ,Uk are identical if d{Vi, Vj) — d{Ui, Uj) 
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end, it is enough to show that if there is a graph G' that is obtained from G by removing less than 
Hjr(W) • r? — er? edges and spans tlo F ^ T then it is possible to remove from W a set of edges of 
total weight less than Ti.jr(W) ■ fc^ and obtain a graph W' that is .F-homomorphism-free. This will 
obviously be a contradiction. 

Assume such a G' exists and apply Lemma [2.61 on it with 7 = q^^^e,k,"^jr[k)) and m = k 
(we use m = A; as G is already partitioned into k subsets Vi, . . . , V^). For the rest of the proof we 
denote by Vi^i, ■ ■ ■ , Vi^i the partition of Vi that Lemma 12.61 returns. Recall that as = ... = |Vfc| 
and Lemma 12.61 partitions a graph into subsets of equal size, then all the sets Vi are partitioned into 
the same number / of subsets. Note also that by Lemma 12.61 and the definition of T in (l2|) we have 
/ < T. Observe, that T is in fact an upper bound for the total number of partition classes Vi^j). 

By Lemma 12.61 (recall that by Comment 12.41 we may assume '^ [2^ ^e, fc, ^!jr[k)) < ^e), we are 
guaranteed that out of the Ik sets Vij at most §('2) pairs are not " fj^ ^e, k, 'I'jr(fc))-regular. We 
define a graph G" , which is obtained from G' by removing all the edges connecting pairs (V^^j/, Vjji) 
that are not q^^^e,k,"^yr(^k))-regulav, and all edges connecting pairs {Vi^ii,Vjj') for which their 
edge density in G' is smaller than ^e. 

Proposition 4.3 There are k sets Vi^n , ■ ■ ■ , ^k^t,, such that the graphs induced by G and G" on these 

2 2 

k sets differ by less than Tly^{W) ■ ^ — ^ edges. 

Proof: We first claim that G" is obtained from G' by removing less than ^n? edges. To see this 
note that the number of edges connecting a pair (V^^j/, Vjj^) is at most {n/kl)'^. As there are at most 
1(^2) pairs that are not " J2^ k^^ ^' ^:F(fc))-regular, we remove at most edges due to such pairs. 
Finally, as due to pairs, whose edge density is at most ^e, we remove at most ( 2') f (ra/A;/)^ < 
edges, the total number of edges removed is at most |n^, as needed. 

As we assume that G' is obtained from G by removing less than Tljr{W) ■ r? — er? edges, we get 
from the previous paragraph that G" is obtained from G be removing less than Tlj^{W) ■ rx^ — 
edges. Suppose for every 1 < i < /c we randomly and uniformly pick one of the sets V^^i, . . . , Vi^i. 
The probability that an edge, which belongs to G and not to G", is spanned by these k sets is l~'^ . 
As G and G" differ by less than Hj^(W) ■ — edges, we get that the expected number of such 

2 2 

edges is less than Hj^iW) ■ ^ — ^ and therefore there must be a choice of k sets that span less than 
this number of such edges. ■ 

We are now ready to arrive at a contradiction by showing that if it is possible to remove less than 
TiyriW) ■ — erP edges from G and thus turn it into an .F-free graph G', then we can remove from 
W a set of edges of total weight less than 7{j^{W) • k^ and thus turn it into an .F-homomorphism-free 
graph W'. Let Vi^jj, . . . , V^^jj. be the k sets satisfying the condition of Proposition 14.31 and obtain 
from W a graph W' by removing from W edge {i,j) if and only if the density of (Vi^n, '^j,t ) in G" 
is 0. 

Proposition 4.4 W is T -homomorphism-free. 

Proof: Assume F' ^ W' for some F' G J-. As W' is a graph of size k this means (recall Definition 
14. 2|) that there is F S of size at most ^ jr{k) such that F ^ W'. Let if he a homomorphism from F 
to W' . By definition of for any {u, v) G E{F) we have i<f{u), if{v)) is an edge of W'. Recall that by 
definition of G" either the density of a pair (V^^j/, Vjj/) in G" is zero, or this density is at least and 
the pair is ^^i^^^e, k,"^ ^{k))-vegulav. By definition of W, this means that for every {u,v) G E{F) 
the pair {V^{u),t^^^) , ) has density at least f in G" and is T2Mh^ ^' ^.F(A;))-regular. By item 
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1 of the lemma we have for sdl 1 < i < k that \Vi\ > A j3^ fc, e). By our choice in (jj]) and the fact 
that / < T, the sets Vi^a must therefore be of size at least 

\^k, e)\/l> |A^(A;, e)\/T = A^^e, k, 

Hence, the sets Vi^n , ■ ■ ■ , Vk,tt, satisfy all the necessary requirements needed in order to apply Lemma 
l2.3l on them in order to deduce that they span a copy of F in G" (recall, that we have already argued 
that |y(-F)| < ^jr(fc))- This, however, is impossible, as we assumed that G' was already ^-free and 
G" is a subgraph of G'. ■ 

Proposition 4.5 For any i < j the edge densities of {Vi,Vj) and (V^^j. , V^-^t^) satisfy in G 

\diV„V,)-d{V,,u,V,,t,)\<le. 

Proof: Recall that l/l > l/T and by §^ we have l/T > q!^k,e). We infer that \Vi^t,\ = \Vi\/l> 
' J^^ k, e)\Vi\. By item 2 of the lemma, each pair {Vi,Vj) is " J^^ k, e)-regular in G. Hence, by 
definition of a regular pair, we must have \d{Vi, Vj) — d{Vi^ti, Vj,tj)\ < ^gT^ fc, e) < ■ 

Proposition 4.6 W' is obtained from W by removing a set of edges of weight less than Tly^{W) • k'^. 

Proof: Let S be the set of edges removed from W and denote by w{S) the total weight of edges in 
S. Let e{Vi^ti, Vj,tj) denote the number of edges connecting the pair (V^^^. , Vj^tj) in G. We claim that 
the following series of inequalities, which imply that w{S) < Ti.y^{W) ■ k'^, hold: 



2 2 

En en 



w{S) 



9 2 

n en 



Indeed, recall that by the definition of W, we have {i,j) G if and only if the density of the 
pair (Vi^i', Vjj') in G" is 0, which means that all the edges connecting this pair were removed in G" . 

1—1—1 2 2 

As by Proposition 14.31 the total difference between G and G" is less than 7ij^{W) ■ j2 — ^ we infer 
that the first (strict) inequality is valid. The second inequality follows from Proposition 14.51 together 
with the fact that by the condition of the lemma we have d{Vi,Vj) = w{i,j). The third inequality is 
due to the fact that W has k vertices and thus \S\ < k'^. ■ 

The sought after contradiction now follows immediately from Propositions 14.41 and 14. 6i This 
completes the proof of the lemma. ■ 

We continue with the proof of Lemma 13.51 
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Proof of Lemma 13. 5t We prove the lemma with: 



J ^g2 ^ ^ Q 

= [ min(l6r-2, ig^, ^r, ^e)), r > 1 

and 

We start with showing that Ejr{G) < TiyriyV) + e. Suppose G is a graph of n vertices, in which 

2 

case the number of edges connecting Vi and Vj is w{i,j) ■ p-. We first remove all the edges within the 

sets Vi, . . . ,Vk. As k > 1/e the total number of edges removed in this step is at most ^("'2'^) < erP. 

Let S be the set of minimal weight whose removal turns W into an ^-homomorphism-free graph 
W. We claim that if for every £ S we remove all the edges connecting Vi and Vj the resulting 
graph G' spans no copy of a graph F £ J^. Suppose to the contrary that G' spans a copy of F S 
and consider the mapping ip : V{F) 1— > {1, . . . ,k} that maps every vertex of F that belongs to Vj to 
j. As we have removed all the edges within the sets Vi, . . . ,Vk and all edges between Vi and Vj for 
any (i, j) £ S we get that (/? is a homomorphism from F to W contradicting our choice of S. Finally, 
note that the number of edges removed in the second step is 

2 

Combined with the first step the total number of edges removed is at most ■ Tlj^{W) + en?, as 
needed. 

For the rest of the proof we focus on proving Ti.jr(W) < Ejr{G) + e. Let A and B be the two 
equipartitions from the statement of the lemma. Suppose for every 1 < i < fc we randomly, uniformly 
and independently pick a set Vj,*. out of the sets Vi^i, . . . , Vi^i. Let P denote the event that (i) All 
the pairs {Vi^tiiVi' ,t^,) are £'(/c) -regular, (ii) All but at most \e{^ of the pairs {Vi^ti^Vi' ,1^,) satisfy 
\d{Vi^ti-,Vii^tii) — d{Vi,Vii)\ < <?(0). We need the following observations: 

Proposition 4.7 P holds with probability at least 1 — ^e. 

Proof: Fix any i < i' . By definition of 43.51 have £{k) < |eA;~^, thus by item 1 of Definition 
12.81 the probability that {Vi^n, ^4',*^;) is not iS(A;)-regular is at most |efc~^. By the union bound, the 
probability that one of the pairs is not <5(A;) -regular is at most (2)^^^^''^ ^ i^- 

Item 2 of Definition 12.81 can be rephrased as stating that there are at most £{0)(^^ = ■^^(-'^{2) 
choices of i < i' for which the probability that \d{Vi^ti, ^i'.t^,) — d{Vi, Vi')\ > £{0) = j^e^ is larger than 
£{0) = jq(^- Thus, the expected number oi i < i' for which \d{Vi^ti,yi' ,t^i) — d{Vi,Vii)\ > <5(0) is at 

most jQ(?i^2) ■ 1 + (2) ■ T&^^ - I^^(2)- Markov's inequality, the probability that more than ^e{'^) 
oi i < i' violate the above inequality is at most |. 

As properties (i) and (ii) of event P each hold with probability at least 1 — |e, we get that P 
holds with probability at least 1 — ^e. ■ 

Proposition 4.8 Assume event P holds and denote by G' the subgraph of G that is spanned by the 
sets Vi,t„. ■ ■ , Vk,t,. Then, Er{G') > n^{W) - \t. 
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Proof: Let W be a weighted complete graph on k vertices satisfying w{i, i') = diVi^n-, yi',t^,)- Event 
P assumes that ah the pairs (V^^j. , Vi'^t., ) are <S(/i;)-regular. As £{k) < " J^^ k, |e) and the lemma 
assumes that \Vij\ > A j3^ /i:) = A jg^ fc, |e) we may deduce from Lemma [37il that 

EAG') > -HAW) - |. (6) 

Now, event P also assumes that all but at most 1(2) of the pairs i < i' are such that V^') — 

Vj/^tj, )| < ^(0) < |. This means that the sum of edge weights of W differs from the sum of 
edge weights of W by at most | (2) due to pairs that violate the above inequality and by at most 
(2)1 due to the other pairs. This means that the sum of edge weights of W differs from that of W 
by at most ffc^ + ^/c^ < ^A;^. This clearly imphes that 

HAW) > HAW) - ^ . (7) 

o 

The proof now follows by combining ([6]) and d?]). ■ 

Let R be an arbitrary set of edges whose removal from G turns it into an .F-free graph. Randomly 
and uniformly select a set Vi^a from each of the sets Vi^i, . . . , Vi^i, and let R' denote the set of edges 
of R that are spanned by these k sets. We claim that the following upper and lower bound on the 
expected size of R' hold: 



^■\R\ = E[\R'\ 



> E[\R'\ I P] ■ Prob[P] 

> {l-'-)-E[\R'\\P] 



> inAw)-e) 



in? 



n 



Indeed, the equality is due to the fact than an edge of R has probability precisely l/l"^ to be in 
R' . The second inequality is due to Proposition 14.71 the third is due to Proposition 14.81 and the last 
is valid because TLjriW) < 1. As we thus infer that \R\ > TCjr{W) ■ -n? — ev? for arbitrary R, we get 
that Ejr[G) > 7ij^{W) — e, thus completing the proof. ■ 

5 Proofs of Algorithmic Results 

The technical lemmas proved in the previous sections enabled us to infer that certain f-regular 
partitions may be very useful for approximating E-p. In this section we apply Proposition 12. H1 in 
order to efficiently obtain these partitions. We first prove Theorem ll.il while overlooking some subtle 
issues. We then discuss them is detail. 

Proof of Theorem II. It Fix any e > and monotone graph property V. Let = T-p be the family 
of forbidden subgraphs of V as in Definition 13.11 As satisfying V is equivalent to being .F-free, we 
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focus on approximating Ejr{G). Let S^^ r) and A ^^ r) be the appropriate function with respect 
to and e. Put S{e) = S^^ l/e, 43.51 ) ^'^'^ recall that by Proposition 12. 101 the integer S can indeed 
be upper bounded by a function of e. 

If an input graph has less than 5(e) • A^^S'(e)) vertices we use exhaustive search in order 
to precisely compute Ejr{G). Assume then that G has more than 5(e) • A^^5(e)) vertices, and 
use Proposition 12.111 with m = 1/e and S^^ r) as above in order to compute the equipartition 
"4. = {Vi I 1 < i < A:} and its refinement B = {Vij | 1 < i < fc, 1 < J < satisfying the conditions 
of Lemma 12.91 As m is bounded by a function of e we get from Proposition 12 . 1 11 that this step takes 
time O(n^). Also, by Lemma [2.91 we have kl < 5, therefore, as G has at least 5(e) • A^3](5(e)) 
vertices each of the sets V^j is of size at least A j33] (5(e)) > A j33] (fc). Let be a weighted complete 
graph of size k where w{i,j) = d{Vi, Vj). Using exhaustive search, we can now precisely compute the 
value of njr{W). By Lemma[33]we may infer that |^^(G) - W^(VF)| < e. ■ 

As we have mentioned in the introduction, one should specify how the property V is given to the 
algorithm. For example, V may be an undecidable property, in which case we cannot do anything. We 
thus focus on decidable graph properties. However, even in this case we may face some unexpected 
problems. Note, that for a general infinite family of graphs it is not clear how to compute Tiy^ 
in finite time. Also, returning to the overview of the proof of Lemma 12.91 given in Section [21 note 
that we have implicitly assumed that one can compute the function £, as this is needed in order to 
compute the parameters with which one applies Lemma I2.1U[ A close inspection of the proofs of 
Lemmas 13.41 and 13.51 reveals that computing £ involves computing the function (see ^ , ([3]) and 
One of the main results of |5| asserts that somewhat surprisingly, there is a family of graph 
properties for which the property of being ^-free is decidable (in fact, in coNP) but at the same 
time is not computable. Therefore, even if we confine ourselves to decidable graph properties we 
still run into trouble. 

Suppose first that e is not part of the input to the algorithm. As we have discussed in Section [21 
in this case all the applications of £ [3 g| are on inputs of size depending on e only, thus the algorithm 
may "keep" the answers to these (finitely many) applications of <^3] as part of its description. 
Similarly, in this case we may need to compute Tlj^ on graphs of size depending on e onlj|2|, thus 
the algorithm may "keep" the answers to these (finitely many) applications of TCjr as part of its 
description. Observe, that we don't need to keep the answer of Tlj^ for all the (infinite) range of edge 
weights. Rather, as we only need to approximate Ejr within an additive error of e, it is enough to 
consider edge weights {0, e, 2e, 3e, . . . , 1}. 

If we want the algorithm to be able to accept e as part of the input, then we must confine 
ourselves to properties for which ^'jp is computable. However, as for any reasonable graph property 
this function is computable, this is not a real constraint. For example, as we have mentioned in 
Section [H if V is the property of being bipartite, then ^jr(^k) is either k or k — 1. Another natural 
family of properties for which ^'^^(fc) is computable is that of being H-fiee for a fixed graph H, as 
in this case ^y^{k) < \V{H)\. By the definition of the function <^3] we get that if is computable 
then so is <^3} It is also not difficult to see that if ^'jf is computable then so is TCy^. Therefore, in 
case is computable, there is no problem with accepting e as part of the input. 

We now turn to prove Theorem 11.21 We note that the above difficulties are also relevant for 
Corollarv ll.2[ which applies Theorem II. 2[ but we refrain from discussing them again. 

Proof of Theorem II. 2t (sketch) As in the previous proof, we focus on the property of being J^- 

^Recall that the size of the graph on which we compute Ti,jr is the number of partition classes of the S-regular 
partition, and this number is at most S^^m,£), which is bounded by a function of e. 
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free, where T is the family of forbidden subgraphs of 7^. Suppose, as in the previous proof, that G is a 
large enough graph (in terms of e) as otherwise we can take D to be the entire vertex set of G. Assume, 
we implicitly apply Lemma 12.91 on G and let A = {Vi | 1 < i < k}, B = {Vi^ | 1 < i < A;, 1 < j < ^} 
be the equipartitions returned by the lemma. Let W he a weighted complete graph on k vertices, 
where w{i,j) = d{Vi,Vj). By Lemma [331 we have 

\E^{G)-nAW)\<e. (8) 

Let I? be a random set of vertices and for 1 < i < k let Ui denote the vertices of D that belong 
to Vi, and for 1 < i < k,l < j < I let Uij denote the vertices of D that belong to Vij. Recall 
that k and / are bounded by functions of e. Using standard Chernoff Bounds (see, e.g., [8]), it is 
easy to see that if we use a large enough sample of vertices D (but only large enough in terms of 
e), then with high probability (whp) we will have \d{Vi,Vi') — d{Ui,Ui')\ < e for any i < i' and 
\d{Vij, Vi'j') — d{Uij, Ui'j')\ < e for any i < i' and j ^ j' . Therefore, if W is a weighted complete 
graph on k vertices, where w(i,j) = d{Ui, Uj) then 

\nAw)-nAw')\<e. (9) 

Furthermore, using Chernoff bounds again, one can show that whp all the pairs {Ui,Ui') and 
{Uij,Uiij') are as regular as {Vi,Vii) and {Vij,Vi'ji) (up to e). Therefore, the graph induced by D, 
denoted G' , will have equipartitions A',B' satisfying the requirements of Lemma 12.91 This means 
that 

\E^(,G')-nAW')\<e. (10) 

As ([I]), ([9]) and (llOp all hold with high probability for any e > 0, we can thus make sure that with 
probability at least 1 — e, we will have |-Ejr(G') — £'jf(G)| < e. This completes the proof. ■ 

6 Overview of the Proof of Theorem 11.31 

For the proof of Theorem 11.31 it will be more convenient to denote by E'p{G) the number of edge 
removals needed to make G satisfy V, in other words E'.p{G) = v? ■ E'p{G). In particular, E'^iG) 
denotes the number of edge removals needed to turn G into an H-iiee graph. We will also denote 
by E'j.{G) the number of edge removals needed to turn G into an r-partite graph (or equivalently 
r-colorable graph). Note, that approximating E'j,{G) within ii?'^ is equivalent to approximating 
E-p{G) within n'^ . 

The main technical result we need in order to obtain Theorem 1 1.31 is an extension of some classical 
results in Extremal Graph Theory. Recall, that Turan's Theorem (see [l5]) states that the largest 
Kr+i-iiee graph on n vertices {K^+i = complete graph on r + 1 vertices) is precisely the largest 
r-partite graph on n vertices. Another classical result is the Erdos-Stone-Simonovits Theorem (see 
[45j), which states that for any graph H of chromatic number r + 1, the largest H-fvee graph on n 
vertices has at most o(n^) more edges than the largest r-partite graph on n vertices. As any r-partite 
graph does not contain a copy of a graph of chromatic number r -|- 1, the above results can thus be 
restated as saying that when H = K^+i we have E'^{Kn) = E'^[Kn) and that for any H of chromatic 
number r -M we have E'^{Kn) - o{n^) < E'^{Kn) < E'^{Kn). 

The main extremal graph-theoretic tool that we use in order to obtain Theorem ll.3l is the following 
result, which greatly extends one of the main results of [l^ . Note, that this result also extends Turan's 
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Theorem and the Erdos-Stone-Simonovits Theorem as it states that E'^{G) and E'^[G) are very close 
not only when G is Kn but already when G has a sufficiently large minimal degree. 

Theorem 6.1 Let H he a graph of chromatic number r + 1 > 3. 

(i) If there is an edge of H whose removal reduces its chromatic number, then there is constant 
fj, = fJ,{H) > such that if G = {V, E) is a graph on n vertices of minimum degree at least 
(1 - ^l)n, then E'^iG) = E',{G). 

(a) Otherwise, there are constants 7 = 'y{H) > and fj, = ^i{H) > such that if G = iV^E) is a 
graph on n vertices of minimum degree at least (1 — /u)n, then 

E',{G) - 0{n^-^) < E'h{G) < E'^{G). 

The assertion of this theorem for the special case of H being a triangle is proved in [T3] and in 
a stronger form in [15]. We note that the n^~^ term in the second item of the theorem cannot be 
avoided. Note, that the error term we obtain in the second part of the theorem is better than the 
error term of the classical Erdos-Stone-Simonovits Theorem. Such improvement of the error term 
was previously known (see, e.g., [20] and [l3]) but only for the case of G being Kn and not for G of 
sufficiently high minimal degree. The proof of Theorem 16.11 appears in Section [71 

Our second tool in the proof of Theorem ll.3l is certain pseudo-random graphs. An (n, d, \)-graph 
is a d-regular graph on n vertices all of whose eigenvalues, except the first one, are at most A in 
their absolute values. This notation was introduced by the first author in the 80s, motivated by 
the fact that if A is much smaller than d, then such graphs have strong pseudo-random properties. 
In particular, (see, e.g., Chapter 9), in this case the number of edges between any two sets of 
vertices U and of G is roughly its expected value, which is |VF|(i/n, (see Section [8] for the 
precise statement). There are many known explicit constructions of (n, d, A)-graphs that suffice for 
our purpose here. Specifically, we can use, for example, the graph constructed by Delsarte and 
Goethals and by Turyn (see [37]). In this graph the vertex set V{G) consist of all elements of the 
two dimensional vector space over GF{q) {q is any prime power), so G has n = vertices. To define 
the edges of G we fix a set L oi k lines through the origin. Two vertices x and y of the graph G are 
adjacent if x — y is parallel to a line in L. It is easy to check that this graph is d = k{q — l)-regular. 
Moreover, because it is a strongly regular graph, one can compute its eigenvalues precisely and show 

2 

that besides the first one they all are either —k or q — k. Therefore, by choosing k = {1 — /^)^^ 
we obtain an (n, d, A)-graph with d = (1 — /x)n and A < ^/n (fi will be chosen as the constant from 
Theorem 16. ip . 

Given a graph F let Fh denote the 6-blowup of F, that is, the graph obtained from F by replacing 
every vertex v E V{F) with an independent set Iv, of size b, and by replacing every edge {u, v) £ E{F), 
with a complete bipartite graph, whose partition classes are the independent sets and /„. It is not 
difficult to show (see Claim [8^ that for any integer r, we have E'j.{Fh) = h'^E'^{F). The final piece 
of notation we need is the Boolean Or, denoted by Gi U G2 of two graphs Gi and G2 on the same 
set of vertices V . Its set of vertices is V , and its set of edges contains all edges of Gi and all edges 
of G2. 

Armed with these preparations, we can now outline the proof of Theorem 11.31 Its first part is 
an easy application of Turan's Theorem for bipartite graphs. The proof of the second part is more 
interesting. Suppose all bipartite graphs satisfy V, and let r -|- 1 ( > 3) be the minimum chromatic 
number of a graph that does not satisfy this property. Fix a graph H of chromatic number r + 1 
that does not satisfy V and let /i be the constant of Theorem 16.11 Consider, first, the case r > 3. 
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In this case we show that any efficient algorithm that approximates E!p[G) up to v?~^ wiU enable 
us to decide efficiently if a given input graph F = {V{F), E{F)) is r-colorable. Indeed, given such 
an F on m vertices, let b = mP where c is large constant, to be chosen appropriately. Let -F^ be the 
6-blowup of F, and let F' be the vertex disjoint union of r copies of F^. Let G' be the (n, d, A)-graph 
with d = {\ — fi)n and A < ^/n, whose number of vertices n, is at least the number of vertices of 
F', and not more than four times of that, and identify the vertices of F' with some of those of G'. 
Let G = G' U F' he the Boolean Or of these two graphs. If F is r-colorable, then so is its blowup 
Ff,, and hence in this case F' has a proper r-coloring in which all color classes have the same size. 
This can be extended to a partition of the vertices of G to r nearly equal color classes, providing 
an r-colorable subgraph of G (which satisfies V by our choice of r) that contains all edges of F', 
and some edges of G' that do not belong to F'. The pseudo-random properties of G' enable us to 
approximate this number well. 

On the other hand, if F is not r-colorable, then any r-colorable subgraph of G misses at least 6^r 
edges of F' , and, by the pseudo-random properties of G' cannot contain too many edges of this graph 
that do not belong to F' . With the right choice of c, this will ensure that if we can approximate 
the number of edges in a maximum r-colorable subgraph of G up to an n^~''-additive error, this will 
enable us to know for sure whether F is r-colorable or not. However, by Theorem 16. H and as the 
minimum degree of our graph is at least (1 — fi)n, the maximum size of an H-fiee subgraph of G is 
very close to the maximum size of an r-colorable subgraph of it, which is therefore also very close 
to the maximum number of edges in a subgraph of G satisfying V. This implies that approximating 
well this last quantity is A^P-hard. The case r = 2 is similar, but here we have to use that the 
MAX-CUT problem is A^P-hard. The full details appear in Section [HI 

7 Proof of Theorem 16.11 

Throughout this section we will assume that the number of vertices n in our graph is sufficiently 
large. We first prove the first part of Theorem 16.11 which is an extension of Turan's theorem. To 
this end, we need a result proved for i^T^+i-free graphs by Andrasfai, Erdos and Sos [9j and in a more 
general form by Erdos and Simonovits [21j . 

Theorem 7.1 ([9j,[21j) Let H he a fixed graph with chromatic number r + 1 > 3 which contains an 
edge e such that x{H — e) = r . If G is an H-free graph of order n with minimal degree S{G) > g^~^ n 
then G is r-colorable. 

We will also need the following simple lemma. 

Lemma 7.1 Let r > 2 be an integer and suppose G' is an r-partite subgraph of a graph G (which 
may he empty) such that there are m edges incident to the vertices in V{G)\V{G'). Then G has an 
r-partite subgraph of size at least e{G') + ^-^m. 

Proof: Let {A\, . . . ,A'^) be the partition of G' . Consider an r-partite subgraph T of G with parts 
such that A'^ C Ai for every z, where we place each vertex v G V{G)\y{G') in Ai 
randomly and independently with probability 1/r. All edges of G' are edges of P, and each edge 
incident to a vertex in V {G)\V {G') appears in T with probability ^—r-. By linearity of expectation 
E[e(r)] = e(G') + ^-^m, so some r-partite subgraph of G has at least this many edges. ■ 

In particular, by taking G' to be the empty graph we obtain that every G contains an r-partite 
subgraph of size at least ^^e(G). 
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Proof of Theorem 16.11 part (i): We prove that E'^{G) = E'^{G) for all graphs G on n vertices 
with minimum degree 

5iG) > 1 1 - r ) n + l. 

^ ' ~ \ 4(r-l)(3r-l)y 

Let r be the largest (in terms of number of edges) r-partite subgraph of G and let F be the largest 
H-hee subgraph of G. To prove the first part of the theorem one needs to show that e{F) = e{T). 
As H is not r-colorable we trivially have e(-F) > e(r). In the rest of the proof we establish that 
e(r) > e{F). First, note that by Lemma l7. II we have 

e(r) > ^e(G) 

= ^ffl-^ S An + l]n/2 

r \\ 4(r - l)(3r - 1)/ J ' 

Vlr^ - 16r + 1 2 r-\ 

-n H n. 



8r(3r - 1) 2r 

If F has a vertex of degree at most ffEj ^ we delete it and continue. We construct a sequence of 
graphs F = Fn, Fn-i, where if F^ has a vertex of degree < |frf ^ we delete that vertex to obtain 
Fk-i- Let F' be the final graph of this sequence which has s vertices and minimal degree greater 
than g^~^ s. Since F' is H-fiee, by Theorem 17. 1| it is r-partite. Therefore we have that 

> 12r^-16r + 1 ^2 3r - 4 , , 
8r(3r-l) 2(3r - 1) ^ ^' 

2 2 

This implies that 2^(3^-1) - 8r(3r-i) so s > n/2. 

Let X be the set of n — s vertices which we deleted, i.e., X = V{G) — V{F'). By the minimal 
degree assumption there are at least 

^r.^M^i 12r2-16r + l , . , , (n - s)^ 

edges incident with vertices in X. Thus, by Lemma EU the size of the largest r-partite subgraph of 
G is at least 

/^/N r-l 3r-4//n+l\ fs + lW r-1 

e(r) > eiF') + --m>eiF)-^^(^(^ 2 )"( 2 JJ+^"^ 

3r — 4 2 2\ 3r — 4 r — 1 

= e{F) -In —s -(n — s H m 

^ ' 2(3r-l)^ ' 2(3r-l)^ ' r 

3r-4 , 2 2x r-1 / 12r2-16r+l , , [n - sf 

> e(F) -(n^-s^) + — Mn-s)-- — 

- ^ ^ 2(3r-l)^ ^ r V4(?' - l)(3r - 1) ^ ^ 2 

This implies that e(r) > e{F) and completes the proof. 
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We turn to prove Theorem 16.11 part (ii). To this end, we first prove the main technical result of 
this section, Theorem 17.21 below, which is a version of Theorem 17.11 that applies to arbitrary graphs 
H. We then apply this theorem in order to prove Theorem 16.11 part (ii). The reader may want to 
note that this application of Theorem 17.21 is similar to the way we applied Theorem 17.11 in order to 
prove the first part of Theorem 16. li 

Theorem 7.2 Let H be a fixed graph on h vertices with chromatic number r + 1 > 3 and let G be an 
H-free graph of order n with minimum degree d{G) > (^-7^ — 3/^) n. Then one can delete at most 
0(n^~*^''"*'^)/'^) edges to make G r-colorable. 

Proof: First we need the following weaker bound on E'^{G). 
Claim 7.2 G can be made r -partite by deleting o{n^) edges. 

Proof: We use the Regularity Lemma given in Lemma 12.61 For every constant < rj < let 
7 = ^[2]3](^) r + 1, /i) <rf' be sufficiently small to guarantee that the assertion of Lemma 12.31 holdj^. 
Consider a 7-regular partition [Ui, U2, ■ ■ ■ Uk) of G. Let G' be a new graph on the vertices 1 <i <k 
in which {i,j) is an edge iff (C/j, Uj) is a 7-regular pair with density at least r/. Since G is an H-fvee 
graph and H is homomorphic to Kr+i (as xi^) = r + l), by Lemma [2.31 G' contains no clique of size 
r + 1. Call a vertex of G' good if there are at most rjk other vertices j such that the pair (Ui, Uj) is 
not 7-regular, otherwise call it bad. Since the number of non-regular pairs is at most 

7(2) < 

we have that all but at most rjk vertices are good. By definition, the degree of each good vertex in 
G' is at least (^-7^ — 3^;^) ^ ~ 2r//c — 1, since deletion of the edges from non-regular pairs and sparse 
pairs can decrease the degree by at most r]k each and the deletion of edges inside the sets Ui can 
decrease it by 1. By deleting all bad vertices we obtain a Kr+i-iree graph on at most k vertices with 
minimal degree at least 

/r-1 ^\k3r]kl>(^~^ ^ k 

\ r 3hr'^ J ~ \ r 3/ir^y 

~ ( r 3r'^^ 
3r - 4, 



Therefore, by Theorem 17. H this graph is r-partite. This implies that to make G r-partite we can 
delete at most 'yn'^ + rjn? -\- (rjn) ■ n + k ■ {n/k)'^ < 3rjn^ -\- /k = o{v?) edges. ■ 

Consider a partition (Vi, . . . , Vr) of the vertices of G into r parts which maximizes the number 
of crossing edges between the parts. Then for every x G Vi and j ^ i the number of neighbors of x 
in Vi is at most the number of its neighbors in Vj, as otherwise by shifting x to Vj we increase the 
number of crossing edges. By Claim [7^21 we have that this partition satisfies that ~ o(n^). 

Call a vertex x of G typical ii x Vi and has at most n/(10/ir^) neighbors in Vi. Note that there 
are at most o(n) non-typical vertices in G and, in particular, every part Vi contains a typical vertex. 
By definition, the degree of this vertex outside Vi is at least (^^ — n — ^^^i > (^-7^ — ^ 
and at most n — \ Vi\. Therefore \Vi\ < (^ + jKr^)''^- ^Iso note that the number of neighbors in Vi of 

^Recall that by Comment I2.4l we may assume that f^^ y, r + l,h) < y'^ . 
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every typical vertex x £ Vj,j ^ i is at least 



dvi{x) > d{x) - dvj{x) - (r - 2) max IV^I 

~ \ r 3hr'^ J \ r 2/ir^ 

1 r-V 

> |--^r^|n. (11) 



r 2/ir2 

The next claim is an immediate corollary of the above observation. 

Claim 7.3 Let U be a subset of Vj of size at least — ■^)n and let yi, . . . ,yk be an arbitrary set 

:es outi 

adjacent to all vertices yi. 



of k < r — 1 typical vertices outside Vj . Then, there are at least 2r{r+i) vertices in U , which are 



Proof: By definition, there are at most \Vj \ —dy.{yi) non- neighbors of yi in Vj and thus there are at 
most that many vertices in U not adjacent to yi. Delete from U any vertex, which is not a neighbor 
of either yi,y2, ■ ■ ■ ,yk- The remaining set is adjacent to every vertex y^ and has size at least 

i 

Since by pT|) the degree in Vj of every typical vertex y^ Vj is at least dv^iyi) > — f;^)^) we 
obtain that the number of common neighbors of yi , . . . , y/; in [/ is at least 

\u\-Y.i\v,\-dy^{y.)) > f^(l-'^)^-m + \u\ 







n — 








fl 




\2r 


4/ir J ' 




n n 


n > 





> \U\ ; n > I ; n ; U 

- ' ' 2hr - \2r Ahr J 2hr 

,1 /c + l\ n n n 
> ^ n > > 



2r 2hr J ~ 2r 2h ~ 2r(r + 1) 



Here we used that k + 1 < r and h > r + 1. 



Claim 7.4 For every non-typical vertex x G Vi there are at least ^f^^^^iy r -cliques yi, . . . ,yr such 
that yj £ Vj for all 1 < j < r and all vertices yj are adjacent to x. 

Proof: Without loss of generality let i = 1 and let x G 14 be a non-typical vertex. Since for every 
J 7^ 1 the number of neighbors of x in Vj is at least as large as the number of its neighbors in Vi we 
have that 

dy^jx) + dy^jx) 1 / /r-1 1 \ 
dy^{x) > >-^---^jn-(r-2)max|y, 

> Ur-^-^)r^-(r-2)(-+-^]n 



2 \\ r Vr 2/ir^ 

> (—- — ^ n. (12) 
- \2r Ahr J ^ ^ 
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To construct the r-cliques satisfying the assertion of the claim, first observe, that since x is 
non-typical it has at least n/(10/ir^) neighbors in Vi and at least n/(10/ir^) — o(n) > n/(15/ir^) of 
these neighbors are typical. Choose yi to be an arbitrary typical neighbor of x in Vi and continue. 
Suppose at step l<A;<r — Iwe already have a fe-clique yi, . . . ,yk such that yi £ Vi for all i and all 
vertices yi are adjacent to x. Let Uk+i be the set of neighbors of x in Vk+i- Then, by (fT2]) we have 
that = (ivfe+i(2;) > — and therefore by Claim [7131 there are at least 2r(r+i) common 

neighbors of the vertices yi in Uk+i- Moreover, at least 2r{r+i) ~ ^i''^) > sTS' of them are typical and 
we can choose yk+i to be any of them. Therefore at the end of the process we indeed obtained at 
least j^i^Y'^ = 5^(3^a)r r-chques with the desired property. ■ 

Claim 7.5 Each Vi contains at most 0(1) non-typical vertices. 

Proof: Suppose that the number of non-typical vertices in Vi is at least bh?{2>r'^Y . Consider an 
auxiliary bipartite graph F with parts Wi, W2, where Wi is the set of some t = 5/i^(3r^)^' non-typical 
vertices in Vi, W2 is the family of all r-element multi-sets of V{G) such that x G Wi is adjacent 
to multi-set Y from W2 iff Y is an r-clique in G with exactly one vertex in every Vj and all vertices 
of Y are adjacent to x. By the previous claim, F has at least e{F) > t ^j^l^^-iy = hn^ edges and 
therefore the average degree of a vertex in W2 is at least dav = ^{F) /\W2\ = e{F)/n'^ > h. By the 
convexity of the function f{z) = we can find h vertices xi, . . . ,Xh in Wi such that the number 
of their common neighbors in W2 is at least 



Thus we proved that G contains h vertices X = {xi, . . . ,Xh} and a family of r-cliques C of size 
m = Qi^n"^) such that every clique in C is adjacent to all vertices in X. Next we need the following 
well-known lemma which appears first implicitly in Erdos |19] (see also, e.g., |27j). It states that if 
an r-uniform hypergraph on n vertices has m = Q{n^^ edges, then it contains a complete r-partite 
r-uniform hypergraph with parts of size h. By applying this statement to C, we conclude that there 
are r disjoint set of vertices Ai, . . . , each of size h such that every r-tuple ai, . . . , with G Ai 
forms a clique which is adjacent to all vertices in X. The restriction oi G to X, Ai, . . . , Ar forms a 
complete (r + l)-partite graph with parts of size h each, which clearly contains H. This contradiction 
shows that there are less than 5/i^(3r^)'' = 0(1) non-typical vertices in Vi and completes the proof 
of the claim. ■ 



Having finished all the necessary preparations, we are now ready to complete the proof of Theorem 
17. 2[ Let hi < h2 < . . . < /i-r+i be the sizes of the color-classes in an r -|- 1 coloring of H. Clearly 
hi < h/[r + 1). Without loss of generality, suppose that Vi spans at least 2/in2-(^+i)/'^ edges. By 
the previous claim, only at most 0{n) of these edges are incident to non-typical vertices. Therefore 
the set of typical vertices in Vi spans at least hn'^~^'^~^^^^'^ edges. Then, by the well known result 
of Kovari, Sos and Turan [36] about Turan numbers of bipartite graphs, Vi contains a complete 
bipartite graph Hi = Kh^^h2 all of whose vertices are typical. If there are at least /13 typical vertices 
in V2 which are adjacent to all vertices of Hi then we add them to Hi to form a complete 3-partite 
graph H2 with parts of sizes hi, h2 and h^ and continue. We claim that if at step 1 < A; < r — 1 there 
is a A: + 1-partite graph Hf^ C ^^=iVi with parts of sizes hi, ... , h^+i all of whose vertices are typical, 
then we can extend it to the complete k + 2-partite graph Hk+i by adding hk+2 typical vertices from 
Vk+i which are adjacent to all vertices of H}^. Indeed, recall that by (jlip the number of neighbors 
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in Vk+i of every typical vertex x G V^, i / A; + 1 is at least (ivi,+i(a^) > — f^)'^- Let t < h he the 
order of Hk- Then, as in Claim [731 the number of vertices in Vk+i which are adjacent to all vertices 
of Hk is at least 



r 



n tir-l)+t-l 
= ; — 7^ n 

^ n t ^ n n n 
~ r 2hr ~ r 2r 2r 

and thus at least n/(2r) — 0(1) > /ifc+2 of these vertices are typical. Continuing the above process 
r — 1 steps we obtain a complete (r + l)-partite graph with parts of sizes hi, ... , /ir+i, which clearly 
contains H. This contradicts our assumption that G is H-fiee and shows that every Vi spans at 
most 0(n^~('"+^)/'*) edges. Therefore the number of edges we need to delete to make G r-partite is 
bounded by '^ie{Vi) < 0[n'^~^'^^^^^^) . This completes the proof. ■ 

Proof of Theorem 16.11 part (ii): Let H he a fixed graph on h vertices with chromatic number 
r + 1 > 3. We show that the constants ^{H) and n{H) in the assertion of the theorem can be 
chosen to be (r + and l/(4/ir'^) respectively. Let G he an H-iiee graph of order n with minimal 
degree 5{G) > (1 — and let T he the largest r-partite subgraph of G and F be a largest H-fiee 

subgraph of G. To prove the second item of the theorem it is enough to show that e(r) < e{F) < 
e(r) + 0(n^^(^'+^)/''). As H is not r-colorable we trivially have e(r) < e{F). In the rest of the proof 
we establish that e{F) < e{T) + ©(n^-^^'+i)/^). By Lemma Owe have that 



/ N r-1 r-\( 1 \ 9, (r-\ r -\\ ^ 

If F has a vertex of degree at most — ^^)n we delete it and continue. We construct a sequence 
of graphs F = Fn, F^-i, where if F^ has a vertex of degree < — ^^)k we delete that vertex 
to obtain F^-i. Let F' be the final graph of this sequence which has s vertices and minimal degree 
greater than {'^-^ — ^^)s and let T' he the largest r-partite subgraph of F'. Since F' is H-fiee, 
Theorem O implies e(F') < e(r') + 0(n2-(^+i)/'»). Therefore we have that 

1 2 /2^ - /^/N. f r — 1 1 \ f f n + l\ f s + 1 



-s^ + o{n^) > e{F')>e{F) 



2r ^ ^ - ^ ^ - ^ ^ \^ ^ J yy 2 J \ 2 

- \ 2r 8/ir3 J \ 2r 6hr^ J ^ i \ i 

This implies that 

( J_ _ r-l \ 2 _ . 2^ (J: \_\ 2 = 

6/ir2 - V6/ir2 Shr^ ) o[n ) > y^^^^ ^^^^ jn - ^^^^^ 

and so s > n/2. 
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Let X be the set of n — s vertices which we deleted, i.e., X = V{G) — V{F'). By the minimal 
degree assumption there are at least 



m > 6{G)\X 




edges incident with vertices in X. Thus, by Lemma |7. 11 the size of the largest r-partite subgraph of 
G is at least 



e(r) > e{r') + - — -m>e{F') -0{n'^^'-''+^^/'') + - — -m 



- ^ ' \ 2r 6hr^ J ^ ^ ^ ^ \\ 2r 4hr^ J 2r J ^ ^ 



8 Proof of Theorem D 

We start with the proof of the first part of Theorem II. 3i If there is a bipartite graph H that does 
not satisfy V, then, by the known results about the Turan numbers of bipartite graphs proved in 
[36j . there exists a positive 5 > such that for any large n, any graph with n vertices and at least 
n'^~^ edges contains a copy of H. Thus, given a graph G on n vertices, one must delete all its edges 
besides, possibly, n'^~^ of them, to obtain a subgraph satisfying V. As certainly the edgeless graph 
satisfies V, this provides the required approximation in this case. 

The proof of the second part is more complicated, and requires all the preparations obtained 
in the previous section. Suppose all bipartite graphs satisfy V, and let r + 1 > 3 be the minimum 
chromatic number of a graph that does not satisfy this property. Fix a graph H of chromatic number 
r + 1 that does not satisfy V. We will show that any efficient algorithm that approximates Elp{G) 
up to will enable us to decide efficiently how many edges we need to delete from a given input 
graph F = {V{F), E{F)) to make it r-partite. For r > 3 this problem contains the r-colorability 
problem, and for r = 2 it is the MAX-CUT problem and therefore it is A^P-hard for every r > 2. 

Given a graph F on m vertices such that we need to delete i edges to make it r-partite, let 
b = where c is a large constant, to be chosen later. Let F^ be the 5-blowup of F, and let F' be 
the vertex disjoint union of r copies of Fj,. Let n = n{H) be the constant from Theorem 16.11 and let 
G' be the (n, d, A)-graph with d = (1 — fj.)n and A < ^/n, described in Section [6l As the integer q in 
the construction discussed in Section [H] can be a prime power, we can always choose the number of 
vertices of G' , which is q^, to be at least the number of vertices of F', and not more than 4 times of 
that. In particular, we have n = @{rmb) = Identify the vertices of F' with some of those 

of G'. Let G = G' U F' he the Boolean Or of these two graphs. 

Suppose, that instead of adding to F' a pseudo-random graph G', we would put any non-edge 
of F' in G with probability 1 — /x. It is easy to see that in this case the expected number of edges, 
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which would be spanned by a set of a vertices that span t edges in F', would be (1 - + ^t. 

The following claim establishes that this is approximately what we find when we add to F' a pseudo- 
random graph. We then use this claim to show that we can also estimate E'^[G) as a function of 
i = El{F). 

Claim 8.1 Let A he a subset of the vertices of G of size a which contains precisely t edges of F' . 
Then the number of edges of G in A satisfies 

2 2 

(l-/i)y + ^f-0(mV/2) <eG{A) < (l-^)^ + /,t + 0(mV/2). 

Proof: By construction, the edges of the subgraph of F' induced on the set A form an edge disjoint 
union of complete bipartite graphs we denote by Fj = (C/j, Wj), 1 < i < k. Thus = t and 

the fact that F' is a blowup of r disjoint copies of F, which altogether have rm vertices and at most 
r(^) edges, implies that k < ?'(^) < rm?. The number of edges of G spanned on A is the number of 
edges of G' inside A, minus the number of edges of G' spanned by the pairs (C/j, Wi), plus the number 
of edges of F' inside A. To estimate this quantity, we need the well-known fact (see, e.g. Chapter 9 
of that the number of edges between two subsets X, Y of an (n, d, A)-graph G' satisfies 

and the fact that in such a graph |e(X) — I < A|X|. Therefore we obtain that 



e{X,Y) 



\X\\Y\d 



n 



eciA) = eciA) -Y,eG'{U^,W,)+t = eciA) + Y,[\Uim\-eG'{U„W,[ 

i=l 1=1 



> 



2n 



i=l 
2 ^ 

= (l-/i)y + -(A: + l)An 

1=1 

= (1 _ +^t_0 (^2^3/2) _ 



The upper bound eciA) < (1 — /^)^ + fit + 0{m?n^/'^^ can be obtained similarly. ■ 

Recall that the 6-blowup Fi, of a graph defined in Section [6l is the graph obtained from F by 
replacing every vertex v G V{F) with an independent set ly, of size 6, and by replacing every edge 
{u,v) G E{F), with a complete bipartite graph, whose partition classes are the independent sets lu 
and /„. 

Claim 8.2 For any graph F and any integer b, we have E'j.(Fh) = h'^El{F). 
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Proof: We start by showing that E'j.{Fb) < l?E'^{F). Suppose S is a set of E'^[F) edges whose 
removal turns F into an r-colorable graph F' . Suppose we remove from Fi, all the edges connecting 
lu and It, for any {u,v) G S. Note, that we thus remove b'^El{F) edges from F^,. We claim that the 
resulting graph F^ is r-colorable. Indeed, let c : V{F) ^ {1, . . . be a r-coloring of F' and note 
that by definition of F^, if we color all the vertices of ly with the color c{v), we get a legal r-coloring 
of F'. Therefore < h'^E'^{F). 

To see that E'^{Ff)) > b'^E'^{F), let S* be a set of edges whose removal turns Fi, into an r-colorable 
graph, and suppose for every v £ V{F) we randomly pick a single vertex from each of the sets ly. For 
every edge of S, the probability that we picked both of its endpoints is therefore the expected 
number of edges spanned by these vertices is 15*1/6^. As the removal of the edges of S makes Fh 
r-colorable, this in particular applies to all of its subgraphs. Note, that for any choice of a single 
vertex from each of the independent sets ly, the graph they span is isomorphic to F. Thus, any such 
choice spans at least E'^{F) of the edges of S. It thus must be the case that \S\/lP' > and 
the proof is complete. ■ 



Claim 8.3 The graph G satisfies 

' < ©(m^n^). (13) 



Proof: Fix a partition of F into r parts which misses exactly (. edges and consider r disjoint copies 
of F. By taking appropriately difi'erent parts in every copy of F we can partition this new graph 
into r equal parts such that exactly ri edges are non-crossing. Since F' is a 6-blowup of r disjoint 
copies of F, this gives a partition of F' into equal parts which misses rilP' edges. We can extend this 
to a partition of G into r nearly equal sets V{G) = Vi U . . . U which misses exactly rlb"^ edges of 
F' . Let ti be the number of edges of F' inside V^, then = rib'^. This, together with Claim [ 
implies that it is enough to delete at most 



i=l i=l 



2 

< (l-^)rM±ll! + /.^t, + 0(mV/2) 



{l-^JL)— + iiHh^ + Ofm^n^/^) . 
2r ' 



edges to make G r-partite and hence to satisfy property V. 

On the other hand, by Claim W?2\ any partition of F', which is 6-blowup of r disjoint copies of F, 
into r parts misses at least r£6^ edges. Therefore for every partition of the vertices of G into r sets 
there are at least rth"^ edges of F' which are non-crossing. Let Vi U . . . U 14 be a partition of V{G) 
that maximizes the number of crossing edges and let again ti be the number of edges of F' inside Vi 
(note that in this case the sets Vi are not necessarily of the same size) . Using Claim 18. H together 
with the fact that ti > rlh'^ and the Cauchy-Schwartz inequality, we conclude that 
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j^eam > ^((l-^)^ + /.t.-0(mV/2)) 
i=i i=i ^ ^ 

2 

This completes the proof of the claim. ■ 

We are now ready to complete the proof of Theorem ll.3[ Choose the constant c to be sufficiently 
large so that 2/(c + 1) < min((5, 7, 1/4). Recall, that as we chose b = and n = Q{m^^^), we have 



n 



2-5 = 0(62), n^-i = o{b''), m^n'/^ = o{b^). (14) 



Also, as G has minimum degree (1 — fi)n we get from Theorem 16. H that 

E'h{G) > BUG) - 0{n'-^). (15) 



As H does not satisfy V we clearly have E!p{G) > E'^{G). Combining this with ([T3]), ([HD and ([E 
we get 



E't,{G)>E'h{G) > E',{G)-0{ 



2-7^ 



n 

> (1 _ ^) ^ + ^^^52 _ Q (^2^3/2^ _ Q (^2-7) 

2 

> (1-^)— + /ur^62_o/^2N 

2r 

Furthermore, by our choice of r, we get that any r-colorable graph satisfies hence we infer from 
(dSD and (mi) that 

2 

^^p(G)<^;(G) < (l-^)|^ + ^r£62 + 0(mV/2) 

< {I - ^)— + ^rlb^ + oib^) . 

2r ' 

We thus conclude that \E'j,{G) — ((1 — /i)^ + iirl}p')\ < o{b'^). Therefore, if one can approximate 
E!p{G) in time polynomial in n (and hence also in m) within an additive error of 'n?~^ = o{b'^) 
then one thus efficiently computes an integer L, which is within an additive error of 0(6^) from 
(1 — /^)^ + fJ-rib'^. But as in this case £ is precisely the nearest integer to (L — (1 — ^)^)/ firb"^ , 
this implies that we can precisely compute the number of edge removals, needed in order to turn 
the input graph F into an r-partite graph. This implies that the problem of approximating E'^{G) 
within 71,2"'^ is A^P-hard, and completes the proof of Theorem 11.31 
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Concluding Remarks and Open Problems 



We have shown that for any monotone graph property V and any e > one can approximate 
efficiently the minimum number of edges that have to be deleted from an n-vertex input graph 
to get a graph that satisfies V, up to an additive error of ev?. Moreover, for any dense monotone 
property, that is, a property for which there are graphs on n vertices with n{'n?) edges that 
satisfy it, it is A^P-hard to approximate this minimum up to an additive error of n?~^ . It will 
be interesting to obtain similar sharp results for the case of sparse monotone properties. In 
some of these cases (like the property of containing no cycle, or the property of containing no 
vertex of degree at least 2) the above minimum can be computed precisely in polynomial time, 
and in some other cases, a few of which are treated in [12], [E], [l6], a precise computation is 
known to be hard. Obtaining sharp estimates for the best approximation achievable efficiently 
seems difficult. 

As we have mentioned in Section [H a special case of Theorem 11.31 implies that for any non- 
bipartite H, computing the smallest number of edge removals that are needed to make a graph 
iJ-free is NP-\iaid. This is clearly not the case for some bipartite graphs such as a single edge 
or any star. It will be interesting to classify the bipartite graphs for which this problem is 
iVP-hard. 

It seems interesting to decide if one can obtain a result analogous to Theorem ll.3l for the family 
of hereditary properties. 

A weaker version of Theorem 11.11 can be derived by combining the results of [7] and |24j . 
However, this only enables one to approximate E-piG) within an additive error e in time nf^^\ 
while the running time of our algorithm is of type f{e)v?. 

Recall that E'yr{G) denotes the smallest number of edge deletions that are needed in order to 
make G J- -free. For a family of graphs J^, let vjr[G) denote the J^-packing number of G, which 
is the size of the largest family of edge-disjoint copies of members of which is spanned by 
G. Let i'^{G) denote the natural Linear Programming relaxation of vjr^G). Haxell and Rodl 
[32] and Yuster [U] have shown that i^j^iG) < v*jr{G) < vjr{G) + e-n? for any JF and any e > 0, 
implying that for any finite , vjr{G) can be approximated within any additive error of en? 
by solving the Linear Program for computing v^{G). One may wonder whether it is possible 
to obtain Theorem 1 1.1 1 by solving the natural Linear Programming relaxation of E'jr{G), which 
we denote by E^(G). Regretfully, this is not the case. Linear Programming duality implies 
that -E^(G) = z^^(G) and by the resuhs of [32] and 07] we thus have 

u^{G) < E*^{G) < u^{G) + en^ . (16) 

Consider now any which does not contain the single edge graph and note that we trivially 
have iijr[Kn) < 5(2) — i"-^ denote by Kn the n-vertex complete graph). If T contains 
a bipartite graph then by the theorem of Kovari, Sos and Turan (see Section [6|) we have 
E'jr{Kn) > (2) - n^""^ > (^ - o{l))r?. If on the other hand ah the graphs in T are of chromatic 
number r > 3 then clearly they all must contain at least (2) edges, and therefore we must have 
ujr(Kn) < (2)7(2) — r{r-i) ' othcr hand, by the theorem of Erdos-Stone-Simonovits 

(see Section [6]) E'jr{Kn) > 2{r-i) ~ o(?^^)- In any case, we have that ujr[Kn) + 5n? < E'jr{Kn) 
for some fixed 5 = 6{J-) > 0. Combined with (jl6p we get that for any not containing the 
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single edge graph E^{Kn) + Sn? < E'jr{Kn)- Thus, the (trivial) case in which contains a 
single edge is the only one for which computing E'^{G) is guaranteed to approximate E'^{G) 
within en^ for any e > 0. In fact, in this degenerate case we actually have E*j^{G) = E'-piG). 
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